Data Auditor: Exploring Data Quality and Semantics using Pattern Tableaux

نویسندگان

  • Lukasz Golab
  • Howard J. Karloff
  • Flip Korn
  • Divesh Srivastava
چکیده

We present Data Auditor, a tool for exploring data quality and data semantics. Given a rule or an integrity constraint and a target relation, Data Auditor computes pattern tableaux, which concisely summarize subsets of the relation that (mostly) satisfy or (mostly) fail the constraint. This paper describes 1) the architecture and user interface of Data Auditor, 2) the supported constraints for testing data consistency and completeness, 3) the heuristics used by Data Auditor to “tune” a given constraint or its associated parameters for better fit with the data, and 4) several demonstration scenarios. using real data sets.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Efficient and Effective Analysis of Data Quality using Pattern Tableaux

Data Auditor is a system for analyzing data quality via exploring data semantics. Given a user-supplied constraint, such as a functional dependency or an inclusion dependency, the system computes pattern tableaux, which are concise summaries of subsets of the data that satisfy (or fail) the constraint. The engine of Data Auditor is an efficient algorithm for finding these patterns, which defers...

متن کامل

Discovering Pattern Tableaux for Data Quality Analysis: a Case Study

In this paper, we present a case study that illustrates the utility of pattern tableau discovery for data quality analysis. Given a usersupplied integrity constraint, such as a boolean predicate expected to be satisfied by every tuple, a functional dependency, or an inclusion dependency, a pattern tableau is a concise summary of subsets of the data that satisfy or fail the constraint. We descri...

متن کامل

Surplus Free Cash Flow and Earnings Management: The Moderating Role of Auditor Size

This Study seeks to scrutinize whether surplus free cash flow is correlated with earnings management, if auditor size moderates this relationship. To do so, modified Jones discretionary accrual model (1995) and audit firm size are used as audit quality indicator to measure earnings management. The research hypotheses are built upon a sample of 103 companies listed on the Tehran Stock Exchange d...

متن کامل

The Impact of Dual Role of Forensic Accountant-Audit Firm's Partner on Audit Quality

Many data are effective in audit quality that are not part of the audit requirements. One of them is the personal characteristics of the auditor, such as skills and expertise. Forensic accountants have the skills need to spend on specialized fraud courses that they lack a formal non- forensic accountant. To investigate the Impact of dual role of forensic accountant- Official Accountant and audi...

متن کامل

Providing a Model for Assessment Internal Control Quality Based on the Characteristics of the Entity, the Characteristics of Auditor and Their Expected Goals in the Firm's Listed in Teh-ran Stock Exchange

According to the domestic studies conducted on in the field of internal controls, the gap of providing models for identifying weak internal controls is felt completely. The present study is aimed at providing a model for assessing the quality of internal controls based on the characteristics of the economic unit, the characteristics of auditor as well as their expected goals in the Firm's liste...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • PVLDB

دوره 3  شماره 

صفحات  -

تاریخ انتشار 2010